Suffix Tree Clustering on Post-retrieval Documents

نویسندگان

  • Guihong Cao
  • Dawei Song
  • Peter Bruza
چکیده

Clustering is used to divide a collection of data into groups based on similarity of objects. With respect to IR, document clustering has been studied. An information retrieval (IR) system would always return a list of retrieved documents to the user. The post-retrieval documents can be clustered in order to help users browse and navigate the searching results. For this purpose, Zamir and Etzioni (1998) have proposed to use a suffix-tree based document clustering algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering of Web Search Results Using Semantic

Clustering is related to data mining for information retrieval. Relevant information is retrieved quickly while doing the clustering of documents. It organizes the documents into groups; each group contains the documents of similar type content. Different clustering algorithms are used for clustering the documents such as partitioned clustering (K-means Clustering) and Hierarchical Clustering (...

متن کامل

Auto-assemblage for Suffix Tree Clustering

Due to explosive growth of extracting the information from large repository of data, to get effective results, clustering is used. Clustering makes the searching efficient for better search results. Clustering is the process of grouping of similar type content. Document Clustering; organize the documents of similar type contents into groups. Partitioned and Hierarchical clustering algorithms ar...

متن کامل

Intelligent Support for Information Retrieval of Web Documents

The main goal of this research was to investigate the means of intelligent support for retrieval of web documents. We have proposed the architecture of the web tool system — Trillian, which discovers the interests of users without their interaction and uses them for autonomous searching of related web content. Discovered pages are suggested to the user. The discovery of user interests is based ...

متن کامل

Phrase based Clustering Scheme of Suffix Tree Document Clustering Model

Document clustering is one of the difficult and recent research fields in the search engine research. Most of the existing documents clustering techniques use a group of keywords from each document to cluster the documents. Document clustering arises from information retrieval domains, and “It finds grouping for a set of documents belonging to the same cluster are similar and documents belongs ...

متن کامل

A New Approach to Search Result Clustering and Labeling

A NEW APPROACH TO SEARCH RESULT CLUSTERING AND LABELING Anıl Türel M.S. in Computer Engineering Supervisor: Prof. Dr. Fazlı Can August, 2011 Search engines present query results as a long ordered list of web snippets divided into several pages. Post-processing of information retrieval results for easier access to the desired information is an important research problem. A post-processing techni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003